Patent Screen 



Your Fasta Input (protein sequence) : 

>CL001305 / SEQ ID NO:2, March 3, 2004 

1 MAFARRLLRG PLSGPLLGRR GVCAGAMAPP CRFVLELPDC TLAHFALGAD 
51 APGDADAPDP RLAALLGPPE RSYSLCVPVT PDAGCGARVR AARLHQRLLH 
101 QLRRGPFQRC QLLRLLCYCP GGQAGGAQQG FLLRDPLDDP DTRQALLELL 
151 GACQEAPRPH LGEFEADPRG QLWQRLWEVQ DGRRLQVGCA QWPVPEPPL 

2 01 HPWPDLPSS WFPDREAAR AVLEECTSFI PEARAVLDLV DQCPKQIQKG 
251 KFQWAIEGL DATGKTTVTQ SVADSLKAVL LKSPPSCIGQ WRKIFDDEPT 

3 01 IIRRAFYSLG NYIVASEIAK ESAKSPVIVD RYWHSTATYA IATEVSGGLQ 
351 HLPPAHHPVY QWPEDLLKPD LILLLTVSPE ERLQRLQGRG MEKTREEAEL 

4 01 EANSVFRQKV EMSYQRMENP GCHWDASPS REKVLQTVLS LIQNSFSEP 



FastAlertP Blast Report: 
BLASTP 2.2.5 [Nov-16-2002 j 

Reference: Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PS I -BLAST : a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= CL001305, SEQ ID NO: 2, March 3, 2004 
(449 letters) 

Database: FastAlert_P . fasta 

221,820 sequences; 37,667,424 total letters 

Score E 

Sequences producing significant alignments: (bits) Vali 

CRA|222000011464779 /dataset=FastAlert_P /length=449 /altid=Derw. . . 909 

CRA|224000047885450 /dataset=FastAlert_P /length=422 /altid=Derw. . . 842 

CRA|224000047885452 /dataset=FastAlert_P /length=251 /altid=Derw. . . 455 

CRA| 179000322886965 /dataset=FastAlert_P /length=200 /altid=Derw. . . 382 

CRA| 179000322886965 /dataset=FastAlert_P /length=200 /altid=Derw. . . 382 

>CRA| 222000011464779 /dataset=FastAlert_P /length=449 

/altid=Derwent | WO2004009778 . 17 /def =/PT= "New human 
kinases and phosphatases, useful for diagnosing, 
treating or preventing atherosclerosis, hypertension, 
AIDS, allergy, multiple sclerosis, osteoarthritis, 
Alzheimer's disease, Crohn's disease, cancer or 
hepatitis" /QU="DPT" /PA="INCYTE CORP" /PD="29- JAN-2004 " 
/PR=" 19- JUL- 2002" /ED="19-FEB-2004" 
Length = 449 

Score = 909 bits (2350), Expect = 0.0 

Identities = 448/449 (99%), Positives = 448/449 (99%) 



Query : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 
MAFARRLLRGPLSGPLLGRRGVCAGAMAPP RFVLELPDCTLAHFALGADAPGDADAPDP 


60 


Sbjct : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPRRFVLELPDCTLAHFALGAD APGDADAPDP 


60 


Query : 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 
RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 


Sbjct: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 


Query: 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 



1 



GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 
Sbjct: 121 GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 180 



Query: 181 DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 24 0 

DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 
Sbjct: 181 DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 240 

Query: 241 DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 300 

DQC PKQ I QKGKFQWAI EGLDATGKTTVTQS VADS LKAVLLKS P PS C I GQWRKI FDDEPT 
Sbjct: 241 DQCPKQIQKGKFQVVAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKI FDDEPT 300 

Query: 3 01 IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 360 

IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 
Sbjct: 301 IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 360 

Query: 361 QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 420 

QWPEDLLKPDLILLLWSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 
Sbjct: 361 QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 420 

Query: 421 GCHWDAS PS RE KVLQTVLSL I QNS FSE P 449 

GCHWDASPSREKVLQTVLSLIQNSFSEP 
Sbjct: 421 GCHWDASPSREKVLQTVLSLIQNSFSEP 449 



>CRA| 224000047885450 /dataset=FastAlert_P /length=422 

/altid=Derwent | WO2003102159 . 1112 /def =/PT= "New NOVX 
polypeptides and nucleic acid molecules useful for 
preventing or treating NOVX-associated disorders, e.g. 
cancer, cardiomyopathy, atherosclerosis or diabetes, in 
chromosome mapping, tissue typing or in 
pharmacogenetics" /QU="DPT" / PA= " CURAGEN CORP" 
/PD="ll-DEC-2003" /PR="04-JUN-2002" /ED= " 25 -DEC- 2003 " 
Length = 422 

Score = 842 bits (2174), Expect = 0.0 

Identities = 420/424 (99%), Positives = 420/424 (99%), Gaps = 3/424 (0%) 



Query : 


27 


MAPPCRFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 


86 






MAPP RFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 




Sbjct: 


1 


MAPPRRFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 


60 


Query: 


87 


ARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQQGFLLRDPLDDPDTRQAL 


146 






ARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQQGFLLRDPLDDPDTRQAL 




Sbjct: 


61 


ARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQQGFLLRDPLDDPDTRQAL 


120 


Query : 


147 


LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 


206 






LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 




Sbjct : 


121 


LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 


180 


Query : 


207 


LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDAT-GK 


265 






LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDAT GK 




Sbjct : 


181 


LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGGK 


240 


Query: 


266 


TTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS 


325 






TTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS 




Sbjct: 


241 


TTVTQS VADS LKAVLL KS P P S C I GQWRKI FDDE PT 1 1 RRAF YS LGN Y I VAS E I AKE SAKS 


300 


Query : 


326 


PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLI LLLTVS PEERLQR 


385 






PVIVDR HSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLI LLLTVS PEERLQR 




Sbjct : 


301 


PVI VDR - - HSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDL I LLLTVS PEERLQR 


358 


Query : 


386 


LQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQTVLSLIQNS 


445 






LQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDASPSREKVLQTVLSLIQNS 





2 



Sbjct: 359 LQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQTVLSLIQNS 418 



Query: 446 FSEP 449 
FSEP 

Sbjct: 419 FSEP 422 



>CRA| 224000047885452 /dataset=FastAlert_P /length=251 

/altid=Derwent | WO2003102159 . 1118 /def =/PT= "New NOVX 
polypeptides and nucleic acid molecules useful for 
preventing or treating NOVX-associated disorders, e.g. 
cancer, cardiomyopathy, atherosclerosis or diabetes, in 
chromosome mapping, tissue typing or in 
pharmacogenomics" /QU="DPT" / PA= " CURAGEN CORP" 
/PD="ll-DEC-2003" /PR="04-JUN-2002" /ED="25-DEC-2003 " 
Length = 251 

Score = 455 bits (1171), Expect = e-128 

Identities = 229/232 (98%), Positives = 229/232 (98%) 

Query: 218 AARAVLEECTS F I PEARAVLDLVDQCPKQ I QKGKFQWAI EGLDATGKTTVTQS VADS LK 277 

A AVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAI EGLDATGKTTVTQS ADSLK 
Sbjct: 20 ALGAVLEECTSF I PEARAVLDLTOQCPKQ I QKGKFQWAI EGLDATGKTTVTQS AADSLK 79 

Query: 278 AVLLKSPPSCIGQWRKIFDDEPTI IRRAFYSLGNYI VASE I AKESAKSPVIVDRYWHSTA 33 7 

AVLLKSPPSCIGQWRKIFDDEPTI IRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTA 
Sbjct : 80 AVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTA 139 

Query: 338 TYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREE 397 

TYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREE 
Sbjct: 140 TYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREE 199 

Query: 398 AELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKVLQTVLSLI QNS FSEP 449 

AELEANSVFRQKVEMS YQRMENPGCHWD AS PSREKVLQTVLSLI QNS FSEP 
Sbjct: 200 AELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKVLQTVLSLI QNS FSEP 251 



>CRA| 179000322886965 /dataset=FastAlert_P /length=200 

/altid=Derwent | WO2003102159 . 1116 /def =/PT="New NOVX 
polypeptides and nucleic acid molecules useful for 
preventing or treating NOVX-associated disorders, e.g. 
cancer, cardiomyopathy, atherosclerosis or diabetes, in 
chromosome mapping, tissue typing or in 
pharmacogenomics" /QU="DPT" / PA= 11 CURAGEN CORP" 
/PD="ll-DEC-2003" /PR="04-JUN-2002" /ED= "25 -DEC- 2003 " 
Length = 200 

Score = 382 bits (981), Expect = e-106 

Identities = 193/195 (98%), Positives = 194/195 (99%), Gaps = 1/195 (0%) 



Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query : 



256 AI EGLDATG - KTTVTQS VADS LKAVLLKS P P S C I GQWRKI FDDE PT 1 1 RRAF YS LGN Y I V 314 

+ 1 EGLDATG KTTVTQS VADSLKAVLLKS PPS C I GQWRKI FDDE PTI I RRAFYS LGNY I V 
3 S I EGLDATGGKTTVTQS VADSLKAVLLKS PPS C I GQWRKI FDDE PT 1 1 RRAFYS LGNY IV 62 

315 ASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 374 

ASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 
63 ASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 122 

375 LTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKV 4 34 

LTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKV 
123 LTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKV 182 

435 LQTVLS L I QNS FSEP 449 



3 



LQTVLS L I QNS FS E P 
Sbjct: 183 LQTVLS LI QNS FSEP 197 



>CRA| 179000322886965 /dataset=FastAlert_P /length=200 

/altid=Derwent | WO2003102159 . 1114 /def =/PT= "New NOVX 
polypeptides and nucleic acid molecules useful for 
preventing or treating NOVX-associated disorders, e.g. 
cancer, cardiomyopathy, atherosclerosis or diabetes, in 
chromosome mapping, tissue typing or in 
pharmacogenomics" /QU="DPT" / PA= " CURAGEN CORP" 
/PD="ll-DEC-2003" /PR="04-JUN-2002" /ED= " 2 5 -DEC- 2 003 " 
Length = 200 

Score = 382 bits (981), Expect = e-106 

Identities = 193/195 (98%), Positives = 194/195 (99%), Gaps = 1/195 (0%) 



Query: 


256 


AI EGLDATG - KTTVTQS VADS LKAVLLKS P PSC I GQWRKI FDDE PT 1 1 RRAF YSLGNY I V 


314 






+ 1 EGLDATG KTTVTQS VADS LKAVLLKS P PSC I GQWRKI FDDE PT 1 1 RRAF YSLGNY IV 




Sbjct : 


3 


S I EGLDATGGKTTVTQS VADS LKAVLLKS PPSC I GQWRKI FDDE PT 1 1 RRAF YSLGNY IV 


62 


Query: 


315 


ASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 


374 






ASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 




Sbjct: 


63 


ASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILL 


122 


Query : 


375 


LTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDASPSREKV 


434 






LTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENPGCHVVDAS PSREKV 




Sbjct: 


123 


LTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDASPSREKV 


182 


Query: 


435 


LQTVLSLIQNSFSEP 44 9 








LQTVLSLIQNSFSEP 




Sbjct: 


183 


LQTVLSLIQNSFSEP 197 





Database: FastAlert_P . f asta 

Posted date: Mar 2, 2004 5:57 AM 
Number of letters in database: 37,667,424 
Number of sequences in database: 221,820 

Lambda K H 

0.321 0.138 0.426 



Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 39,731,498 

Number of Sequences: 221820 

Number of extensions: 1943285 

Number of successful extensions: 4818 

Number of sequences better than l.0e-08: 5 

Number of HSP's better than 0.0 without gapping: 5 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 4808 

Number of HSP's gapped (non-prelim): 6 

length of query: 44 9 

length of database: 37,667,424 

effective HSP length: 106 

effective length of query: 343 

effective length of database: 14,154,504 



4 



effective search space: 4854994872 
effective search space used: 4854994872 



T: 


11 






A: 


40 






XI 


16 


( 7.4 


bits) 


X2 


38 


(14 .6 


bits) 


X3 


64 


(24 .7 


bits) 


SI 


41 


(21.8 


bits) 


S2 


141 


(58. 


9 bits) 



GENESEQP Blast Report: 
BLASTP 2.2.5 [Nov- 16- 2002] 

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997) , 
"Gapped BLAST and PS I -BLAST : a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= CL001305, SEQ ID NO:2, March 3, 2004 
(449 letters) 

Database: geneseqp_all . f asta 

1,546,707 sequences; 273,213,384 total letters 



Score 

Sequences producing significant alignments: 

CRA| 224000046444906 /dataset=GENESEQP /org=Homo sapiens /taxon=9. 
CRA| 224000046444906 /dataset=GENESEQP /org=Homo sapiens /taxon=9. 
CRA| 222000011464779 /dataset=GENESEQP /org=Homo sapiens /taxon=9. 
CRA 1 224000047885450 /dataset=GENESEQP /org=Homo sapiens /taxon=9. 
CRA| 98000043611837 /dataset=GENESEQP /org=Mus musculus /taxon=10. 
CRA| 224000047885452 /dataset=GENESEQP /org=Homo sapiens /taxon=9. 
CRA| 223000060714134 /dataset =GENESEQP /org=Homo sapiens /taxon=9. 
CRA| 114000021936825 /dataset=GENESEQP /org=Mus musculus /taxon=l. 
CRA| 224000046444908 /dataset=GENESEQP /org=Mus musculus /taxon=l. 

>CRA| 224000046444906 /dataset=GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=18-DEC-03 /length=449 
/altid=derwent_ac | ADC63370 /altid=derwent_id | ADC63370 
/def=Human novel kinase protein /patent=US2003087294 -Al 
/pat__section=Claim 
Length = 44 9 

Score = 914 bits (2362), Expect = 0.0 

Identities = 449/449 (100%), Positives = 449/449 (100%) 



(bits) Value 


914 


0.0 


914 


0.0 


909 


0.0 


842 


0.0 


602, 


e-171 


455 


e-127 


441 


e-123 


259 


2e-68 


199 


3e-50 



Query: 1 MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 60 

MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 
Sbjct: 1 MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 60 

Query: 61 RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 120 

RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 
Sbjct: 61 RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 120 

Query: 121 GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 180 

GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 
Sbjct: 121 GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 180 

Query: 181 DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 24 0 

DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 
Sbjct: 181 DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 240 
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Query: 241 DQC PKQ I QKGKFQWAI EGLD ATGKTTVTQSVADS LKAVLLKS P PSC I GQWRKI FDDE PT 300 

DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 
Sbjct: 241 DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 300 

Query: 301 IIRRAFYSLGNYIVASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 360 

IIRRAFYSLGNYIVASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 
Sbjct: 301 IIRRAFYSLGNYIVASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 360 

Query: 361 QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 420 

QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 
Sbjct: 361 QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 420 

Query: 421 GCHWDAS PSREKVLQTVLS L I QNS FS E P 449 

GCHWDASPSREKVLQTVLSLIQNSFSEP 
Sbjct: 4 21 GCHWDASPSREKVLQTVLSLIQNSFSEP 44 9 



>CRA| 224000046444906 /dataset=GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=10- JUN-03 /length=449 
/altid=derwent_ac | ABU08678 /altid=derwent_id | ABU08678 
/def=Novel human thymidylate kinase subfamily kinase 
/patent=US64 89153-Bl /pat_sect ion=Claim 
Length = 44 9 

Score = 914 bits (2362), Expect = 0.0 

Identities = 449/449 (100%), Positives = 449/449 (100%) 



Query : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 


60 






MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 




Sbjct : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 


60 


Query: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 






RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 




Sbjct : 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 


Query: 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 






GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 




Sbjct : 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 


Query : 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 






DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 




SbjCt : 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 


Query : 


241 


DQCPKQI QKGKFQWAI EGLDATGKTTVTQSVADSLKAVLLKSP PS CI GQWRKI FDDE PT 


300 






DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 




Sbjct : 


241 


DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCI GQWRKI FDDEPT 


300 


Query: 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 






IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 




SbjCt: 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 


Query: 


361 


QWPEDLLKPDL I LLLTVS PEERLQRLQGRGMEKTREEAE LEANS VFRQKVEMS YQRMENP 


420 






QWPEDLLKPDLI LLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 




Sbjct: 


361 


QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANS VFRQKVEMS YQRMENP 


420 


Query: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 449 








GCHWDASPSREKVLQTVLSLIQNSFSEP 




Sbjct: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 44 9 





>CRA| 222000011464779 /dataset=GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=04 -DEC- 03 /length=44 9 
/altid=derwent_ac | AAE38613 /altid=derwent_id | AAE38613 



6 



/def=Human thymidylate kinase protein 
/patent=WO200306464 2-Al /pat_section=Claim 
Length = 449 

Score = 909 bits (2350), Expect = 0.0 

Identities = 448/449 (99%), Positives = 448/449 (99%) 



Query : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 


60 






M A IT A DD T T TPf2DT CPDT T PDD^yfanaMflDD DTn/T PT DT^r^T'T AUfAT H RnaDfn 7vn a nnn 
rl/\r /\XK1u1jK^F1jo^F1j1j^K.KO V v^M.o/^i v L/\r^ F Kr V LihjLtk'lJ^ 1 Ltrtxlr /U^Li/\JJ AFoU A±J A.FD F 




Sbjct : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPRRFVLELPDCTLAHFALGADAPGDADAPDP 


60 


Query: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 










Sbjct: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 


Query: 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 






cnr\'h(~ % r*'h.r\r\c , 'CT t DnDT TiTt'DTiT'D/'^RT t t?t t pappcaddtdut pupparddpat madt i.tdt t/~\ 
^^SJHKjKjJV^KjKjt JjJjKiJFljJJiJFiJiKyAljIjnJjijljAUyiiAFKFnJj^lir b/yJFK^yijWyKljVVbVy 




Sbjct: 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 


Query: 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 






LHjKKIjU VbLAy V VF VPbFFijriF V VFULiFbo V Vr PUKbAAKAV LibhC 1 br I PhARAVLDLV 




Sbjct : 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 


Query : 


241 


DQCPKQ I QKGKFQWAI EGLDATGKTTVTQS VADSLKAVLLKS P PSC I GQWRKI FDDE PT 


300 






DQCPKQ I Q KGKFQWA I EGLDATGKTTVTQS VADSLKAVLLKS PPSCI GQWRKI FDDE PT 




Sbjct : 


241 


DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 


300 


Query : 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 






I IRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 




Sbjct: 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 


Query: 


361 


QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 


420 






QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 




Sbjct: 


361 


QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 


420 


Query: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 449 








GCHWDASPSREKVLQTVLSLIQNSFSEP 




Sbjct: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 44 9 





>CRA| 224000047885450 /dataset=GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=18-DEC-03 /length=422 
/altid=derwent_ac | ADC3 9136 /altid=derwent_id | ADC3 9136 
/def=Novel human NOVX polypeptide SEQ ID NO: 78 
/patent=WO2003010327-A2 /pat_section=Claim 
Length = 422 

Score = 842 bits (2174), Expect = 0.0 

Identities = 420/424 (99%), Positives = 420/424 (99%), Gaps = 3/424 (0%) 



Query: 


27 


MAPPCRFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 


86 






MAPP RFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 




Sbjct: 


1 


MAPPRRFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAGCG 


60 


Query: 


87 


ARVRAARLHQRLLHQLRRG P FQRCQLLRLLC YC PGGQAGGAQQG FLLRD PLDD PDTRQAL 


146 






ARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQQGFLLRDPLDDPDTRQAL 




Sbjct : 


61 


ARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQQG FLLRD PLDD PDTRQAL 


120 


Query: 


147 


LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 


206 






LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 




Sbjct : 


121 


LELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQWPVPEPPLHPWPD 


180 


Query: 


207 


LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQI QKGKFQWAI EGLDAT-GK 265 



7 







LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDAT GK 




Sbjct : 


181 


LPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGGK 


240 


Query : 


266 


TTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS 


325 






TTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS 




Sbjct : 


241 


TTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTI IRRAFYSLGNYIVASEIAKESAKS 


300 




326 


PVIVDRYWHSTATYAIATEVSGGLOHLPPAHHPVYO 


385 






PVIVDR HSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQR 




Sbjct : 


301 


PVIVDR- -HSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQR 


358 


Query: 


386 


LQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDASPSREKVLQTVLSLIQNS 


445 






LQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDASPSREKVLQTVLSLIQNS 




Sbjct : 


359 


LQGRGMEKTREEAELEANSVFRQIWEMSYQRMENPGCHVVDASPSREKVLQTVLSLIQNS 


418 


Query : 


446 


FSEP 449 








FSEP 




Sbjct: 


419 


FSEP 422 





>CRA| 98000043611837 /dataset=GENESEQP /org=Mus musculus /taxon=10090 
/mol__type=aa /date=04-DEC-03 /length=395 
/altid=derwent_ac | AAE3 8614 /altid=derwent_id | AAE38614 
/def=Mouse thymidylate kinase protein 
/patent =W02 003 064642 -Al /pat_section=Disclosure ; 
Length =3 95 

Score = 602 bits (1552) , Expect = e-171 

Identities = 294/378 (77%), Positives = 330/378 (87%) 



Query: 


68 


PPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGA 


127 






P + LCVP+ P GCG RV+AAR+H RLL QLRRGP QRCQL +LL Y PG QAG A 




Sbjct: 


16 


PQGAATRLCVPLAPGEGCGPRVQAARVHHRLLQQLRRGPLQRCQLSKLLGYGPGDQAGEA 


75 


Query: 


128 


QQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQV 


187 






Q GFLLRDP D PDTR+ LL+LLG+CQEA RP L EF+AD +G LWQRLWE+Q R++QV 




Sbjct: 


76 


QHGFLLRDPCDHPDTRRDLLQLLGSCQEAARPQLAEFQADSQGLLWQRLWELQGDRQVQV 


135 


Query: 


188 


GCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQI 


247 






CA V+P EP LHP+ + PDL +S VF DR+AARAVLEECTSFIPEARAVLDLVDQCPK++ 




Sbjct : 


136 


DCACVLPAQE PHLHP LL PDLLNS AV FQDRDAARAVLEE CTS F I P E ARAVLDLVDQC P KE V 


195 


Query: 


248 


QKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFY 


307 






QKGKFQV+AIEGLDATGKTT+TQSV++SLKAVLL+SPP CI QWRKI FDDE PT 1 1 RRAF Y 




Sbjct: 


196 


QKGKFQVIAIEGLDATGKTTLTQSVSESLKAVLLQSPPPCISQWRKIFDDEPTIIRRAFY 


255 


Query: 


308 


SLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLL 


367 






SLGNY+VASEIAKES PVIVDRYWHSTATYAIATEVSGGLQ+LPPAHHPVYQWP DLL 




Sbjct: 


256 


SLGNYLVASEIAKESTNFPVIVDRYWHSTATYAIATEVSGGLQYLPPAHHPVYQWPGDLL 


315 


Query: 


368 


KPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHVVDA 


427 






KPDL+LLLTV+ EER++RLQGRG E KT+ EE AELEAN+ VFRQKVEM+ YQRMENP CH+VDA 




Sbjct: 


316 


KPDLVLLLTWSEERTORLQGRGQEKTKEEAELEANOTFRQKVEMTYQRMENPSCHLVDA 


375 


Query : 


428 


SPSREKVLQTVLSLIQNS 445 








SPSRE VLQ VL LIQ+S 




Sbjct: 


376 


SPSRETVLQKVLELIQSS 3 93 





>CRA| 224000047885452 /dataset=GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=18-DEC-03 /length=251 
/altid=derwent_ac | ADC39138 /altid=derwent_id | ADC39138 
/def=Novel human NOVX polypeptide SEQ ID NO: 80 
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/patent=WO20030103 27-A2 /pat_section=Claim 
Length =251 

Score = 455 bits (1171), Expect = e-127 

Identities = 229/232 (98%), Positives = 229/232 (98%) 

Query: 218 AARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLK 277 

A AVLEECTSFI PEARAVLDLVDQCPKQIQKGKFQWAI EGLDATGKTTVTQS ADSLK 
Sbjct: 20 ALGAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSAADSLK 79 

Query: 278 AVLLKSPPSCIGQWRKIFDDEPTI IRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTA 337 

AVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS PVIVDRYWHSTA 
Sbjct: 80 AVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS PVIVDRYWHSTA 139 

Query: 33 8 TYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREE 3 97 

T YAI ATEVSGGLQHLP PAHH PV YQW PEDLL KPDL I LLLTVS PEE RLQRLQGRGME KTRE E 
Sbjct: 140 TYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDL I LLLTVS PEERLQRLQGRGMEKTREE 199 

Query: 398 AELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQTVLSLIQNSFSEP 449 

AELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQTVLSLIQNSFSEP 
Sbjct: 200 AELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQTVLSLIQNSFSEP 251 



>CRA| 223000060714134 /dataset^GENESEQP /org=Homo sapiens /taxon=9606 
/mol_type=aa /date=26-FEB-04 /length=290 
/altid=derwent_ac|ADF76971 /altid=derwent_id | ADF76971 
/def=Novel human secreted and transmembrane protein 
SeqID 646 /patent=WO2003072035-A2 /pat_section=Claim 
Length = 290 

Score = 441 bits (1133), Expect = e-123 

Identities = 229/252 (90%), Positives = 232/252 (92%), Gaps = 4/252 (1%) 



Query : 


202 


PWPDLPS - S WFPDREAARAVLEECTS F I PEARAVLDLVDQCPKQIQKGKFQWAI 


257 






P+ LPS V +R R CT + I PEARAVLDLVDQCPKQIQKGKFQWAI 




Sbjct : 


39 


PIWATLPSWRVSKRERAIPRRGKSLCTVYLLYI PEARAVLDLVDQCPKQIQKGKFQWAI 


98 


Query: 


258 


EGLDATGKTTVTQS VADS LKAVLL KS P P S C I GQWRKI FDDE PT 1 1 RRAF YS LGNY I VAS E 


317 






EGLDATGKTTVTQS VADS LKAVLL KS P P S C I GQWRKI FDDE PT 1 1 RRAF YS LGNY I V AS E 




Sbjct: 


99 


EGLDATGKTTVTQS VADS LKAVLL KS P P S C I GQWRKI FDDE PT 1 1 RRAF YS LGNY I VAS E 


158 


Query: 


318 


IAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTV 


377 






IAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTV 




Sbjct: 


159 


IAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTV 


218 


Query: 


378 


SPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQT 


437 






SPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQT 




Sbjct: 


219 


SPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREKVLQT 


278 


Query : 


438 


VLSLIQNSFSEP 44 9 








VLSLIQNSFSEP 




Sbjct : 


279 


VLSLIQNSFSEP 290 





>CRA| 114000021936825 /dataset=GENESEQP /org=Mus musculus 

/taxon=10090 /mol_type=aa /date=07-MAR- 02 /length=431 
/altid=derwent_ac| ABB57297 /altid=derwent_id | ABB57297 
/def=Mouse ischaemic condition related protein sequence 
SEQ ID NO: 834 /patent=WO200188188-A2 /pat_sect ion=Claim 
Length = 431 

Score = 259 bits (663) , Expect = 2e-68 

Identities = 133/192 (69%), Positives = 152/192 (79%), Gaps = 9/192 (4%) 
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Query: 225 ECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSP 284 

+CTSFIPEARAVLDLVDQCPK++QKGKFQV+AIEGLDATGKTT+TQ + 
Sbjct: 157 QCTSFIPEARAVLDLVDQCPKEVQKGKFQVIAIEGLDATGKTTLTQHFKSLSRLSSYSRH 216 

Query: 285 PSCIGQWRKI FDDEPTI IR RAF YS LGNY I VAS E I AKES AKS PV I VDRYWHSTATY 339 

P CI K + + + ++ F NY+VASEIAKES PV I VDRYWHSTATY 

Sbjct: 217 PPCI KPVEEDLLMMNLLSFEEPFILWANYLVASE I AKESTNFP VI VDRYWHSTATY 272 

Query: 340 AIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAE 399 

AIATEVSGGLQ+LPPAHHPVYQWP DLLKPDL+LLLTV+ EER++RLQGRG EKT+EEAE 
Sbjct: 273 AIATEVSGGLQYLPPAHHPVYQWPGDLLKPDLVLLLTVNSEERVRRLQGRGQEKTKEEAE 332 

Query: 400 LEANSVFRQKVE 411 

LEAN+VFRQKVE 
Sbjct: 333 LEANNVFRQKVE 344 

>CRA| 224000046444908 /dataset=GENESEQP /org=Mus musculus 

/taxon=10090 /mol_type=aa /date=18 -DEC-03 /length=508 
/altid=derwent_ac (ADC63372 /altid=derwent_id | ADC63372 
/def=Mouse thymidylate kinase /patent=US2003087294 -Al 
/pat_section=Disclosure ; 
Length = 508 

Score = 199 bits (506) , Expect = 3e-50 

Identities = 96/120 (80%), Positives = 104/120 (86%) 

Query: 220 RAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAV 279 

+ + + CTS FI PEARAVLDLVDQCPK+ +QKGKFQV+ AI EGLDATGKTT+ TQ + 
Sbjct: 97 KTTTQQCTS FI PEARAVLDLVDQCPKEVQKGKFQVI AI EGLDATGKTTLTQHFKSLSRLS 156 

Query: 280 LLKSP PSCIGQWRKI FDDEPTI IRRAFYSLGNYI VASE I AKESAKSPVI VDRYWHSTATY 339 

PSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKS PV I VDRYWHSTATY 
Sbjct: 157 SYSRHPSCIGQWRKI FDDEPTI IRRAFYSLGNYI VASE I AKESAKS PV I VDRYWHSTATY 216 



Score = 178 bits (452) , Expect = 5e-44 

Identities = 88/100 (88%), Positives = 89/100 (89%) 

Query: 306 FYSLGNYIVASEI AKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 365 

F NY+VASEIAKES PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 
Sbjct: 269 FILWANYLVASEIAKESTNFPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 328 

Query: 366 LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSV 405 

LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAE A V 
Sbjct: 329 LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAEAIATEV 368 



Score = 132 bits (333), Expect = 3e-30 
Identities = 63/73 (86%), Positives = 70/73 (95%) 

Query: 340 AIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAE 399 

AIATEVSGGLQ+LPPAHHPVYQWP DLLKPDL+LLLTV+ EER++RLQGRG EKT+EEAE 
Sbjct: 414 AIATEVSGGLQYLPPAHHPVYQWPGDLLKPDLVLLLTVNSEERVRRLQGRGQEKTKEEAE 473 

Query: 400 LEANSVFRQKVEM 412 

LEANSVFRQKVE + 
Sbjct: 474 LEANSVFRQKVEL 486 



\ 
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Score = 120 bits (300), Expect = 2e-26 

Identities = 64/73 (87%), Positives = 66/73 (90%), Gaps = 1/73 (1%) 

Query: 225 ECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSP 284 

ECTS F I PE ARAVLDLVDQCPKQ I QKGKFQWAI EGLDATGKTTVTQSVADS LKAVLLKS P 
Sbjct: 1 ECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSP 60 

Query: 285 -PSCIGQWRKIFD 296 

S I + R + D 
Sbjct: 61 CTS FI PEARAVLD 73 



Database: geneseqp_all . f asta 

Posted date: Feb 27, 2004 4:29 PM 
Number of letters in database: 273,213,384 
Number of sequences in database: 1,546,707 

Lambda K H 

0.321 0.138 0.426 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 273,243,944 

Number of Sequences: 1546707 

Number of extensions: 12474211 

Number of successful extensions: 32890 

Number of sequences better than 1.0e-08: 9 

Number of HSP's better than 0.0 without gapping: 9 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 

Number of HSP's gapped (non-prelim): 17 

length of query: 44 9 

length of database: 273,213,384 

effective HSP length: 119 

effective length of query: 330 

effective length of database: 89,155,251 

effective search space: 29421232830 

effective search space used: 29421232830 



32867 



T: 

A: 

XI: 

X2: 

X3 : 

SI: 

S2: 
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40 
16 
38 
64 
41 



( 7 
(14 
(24 
(21 



4 bits) 

6 bits) 

7 bits) 

8 bits) 



148 (61.6 bits) 



NCBI pataa Blast Report : 
BLASTP 2.2.5 [Nov-16-2002] 

Reference: Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997) , 
"Gapped BLAST and PSI -BLAST : a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= CL001305, SEQ ID N0:2, March 3, 2004 
(449 letters) 
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Database : pataa . f asta 

143,807 sequences; 24,539,066 total letters 

Score E 

Sequences producing significant alignments: (bits) Value 

CRA|224000047562998 /altid=gi | 27309063 /def =gb | AAO00504 . 1 | Seque . . . 914 0.0 
CRA|224000047566517 /altid=gi j 27309064 /def =gb j AAO00505 . 1 j Seque... 199 3e-51 

>CRA| 224000047562998 /altid=gi | 27309063 /def =gb | AAO00504 . 1 | Sequence 
2 from patent US 6489153 /org= /taxon= /div= 
/dataset=pataa /length=449 
Length =44 9 

Score = 914 bits (2362), Expect = 0.0 

Identities = 449/449 (100%), Positives = 449/449 (100%) 



Query : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 


60 






MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 




Sbjct : 


1 


MAFARRLLRGPLSGPLLGRRGVCAGAMAPPCRFVLELPDCTLAHFALGADAPGDADAPDP 


60 


Query: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 






RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 




Sbjct: 


61 


RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP 


120 


Query : 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 






GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 




SbjCt: 


121 


GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ 


180 


Query: 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 






DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 




Sbjct: 


181 


DGRRLQVGCAQWPVPEPPLHPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLV 


240 


Query : 


241 


DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 


300 






DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 




Sbjct : 


241 


DQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT 


300 


Query: 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 






IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 




Sbjct: 


301 


IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY 


360 


Query: 


361 


QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 


420 






QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP 




Sbjct: 


361 


QWPEDLLKPDLI LLLTVS PEERLQRLQGRGMEKTREEAELEANSVFRQKVEMS YQRMENP 


420 


Query: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 44 9 








GCHWDASPSREKVLQTVLSLIQNSFSEP 




Sbjct: 


421 


GCHWDASPSREKVLQTVLSLIQNSFSEP 449 





>CRA| 224000047566517 /altid=gi | 27309064 /def =gb | AAO00505 . 1 | Sequence 
4 from patent US 6489153 /org= /taxon= /div= 
/dataset=pataa /length=508 
Length = 508 

Score = 199 bits (506), Expect = 3e-51 

Identities = 96/120 (80%), Positives = 104/120 (86%) 

Query: 220 RAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAV 279 

+ ++ CTSFI PEARAVLDLVDQCPK+ +QKGKFQV+ AI EGLDATGKTT+TQ + 
Sbjct: 97 KTTTQQCTSFIPEARAVLDLVDQCPKEVQKGKFQVIAIEGLDATGKTTLTQHFKSLSRLS 156 

Query: 280 LLKSPPSCIGQWRKI FDDEPTI IRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATY 339 
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PSC I GQWRKI FDDE PT 1 1 RRAF YSLGN Y I VAS E I AKES AKS PV I VDR YWHS TAT Y 
Sbjct: 157 SYSRHPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATY 216 



Score = 178 bits (452), Expect = 5e-45 

Identities = 88/100 (88%), Positives = 89/100 (89%) 

Query: 3 06 FYSLGNYIVASEIAKESAKS PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 365 

F NY+VASEIAKES PVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 
Sbjct: 269 FILWANYLVASEIAKESTNFPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPED 328 

Query: 366 LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSV 405 

LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAE A V 
Sbjct: 329 LLKPDLILLLTVSPEERLQRLQGRGMEKTREEAEAIATEV 368 



Score = 132 bits (333), Expect = 3e-31 
Identities = 63/73 (86%), Positives = 70/73 (95%) 

Query: 340 AIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAE 399 

AIATEVSGGLQ+LPPAHHPVYQWP DLLKPDL+LLLTV+ EER++RLQGRG EKT+EEAE 
Sbjct: 414 AIATEVSGGLQYLPPAHHPVYQWPGDLLKPDLVLLLTVNSEERVRRLQGRGQEKTKEEAE 4 73 

Query: 400 LEANSVFRQKVEM 412 

LEANSVFRQKVE+ 
Sbjct: 474 LEANSVFRQKVEL 486 



Score = 120 bits (300), Expect = 2e-27 

Identities = 64/73 (87%), Positives = 66/73 (90%), Gaps =» 1/73 (1%) 

Query: 225 ECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIEGLDATGKTTVTQSVADSLKAVLLKSP 284 

ECTSFI PE ARAVLDLVDQCP KQ I QKGKFQ WAI EGLDATGKTTVTQS VADS LKAVLLKS P 
Sbjct: 1 ECTSF I PEARAVLDLTOQCPKQ I QKGKFQWAI EGLDATGKTTVTQS VADS LKAVLLKS P 60 

Query: 285 -PSCIGQWRKIFD 296 

S I + R + D 
Sbjct: 61 CTSFIPEARAVLD 73 



Database: pataa.fasta 

Posted date: Nov 30, 2003 1:29 AM 
Number of letters in database: 24,539,066 
Number of sequences in database: 143,807 

Lambda K H 

0.321 0.138 0.426 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 23,728,226 

Number of Sequences: 143807 

Number of extensions: 1055474 

Number of successful extensions: 2885 

Number of sequences better than 1.0e-08: 2 
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Number 
Number 
Number 
Number 
length 
length 
effect 
effect 
effect 
effect 
effect 

T: 11 
A: 40 
XI: 16 
X2: 38 
X3: 64 
SI: 41 
S2: 14 



of HSP's better than 0.0 without gapping: 2 

of HSP's successfully gapped in prelim test: 0 

of HSP's that attempted gapping in prelim test: 

of HSP's gapped (non-prelim) : 9 

of query: 44 9 

of database: 24,539,066 

ive HSP length: 103 

ive length of query: 346 

ive length of database: 9,726,945 

ive search space: 3365522970 

ive search space used: 3365522970 



2873 



( 7.4 bits) 
{14 .6 bits) 
(24 .7 bits) 
(21.8 bits) 
0 {58.5 bits) 
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